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1 Introduction 

Active set ideas permeate traditional nonlinear optimization. Classical prob- 
lems involve a list of smooth nonlinear constraints: the active set for a particu- 
lar feasible solution — the collection of binding constraints at that point — is 
crucial in first and second order optimality conditions, in sensitivity analysis, 
and for certain algorithms. Contemporary interest in more general constraints 
(such as semidefiniteness) suggests a reappraisal. A very thorough modern 
study of sensitivity analysis in its full generality appears in [2j. Approaches 
more variational-analytic in flavor appear in texts such as |29j . Our aim here is 
rather different: to present a simple fresh approach, combining wide generality 
with mathematical elegance. 

Our approach has its roots in the notion of an "identifiable surface" [52] , 
and its precursors [10,6,5,4l[T51[TirH]. In essence, the idea is extremely simple: 
given a critical point x for a function /, a set M is identifiable if any sequence 
of points approaching x that is approximately critical (meaning corresponding 
subgradients approach zero) must eventually lie in M. The terminology comes 
from the idea that an iterative algorithm that approximates x along with an 
approximate criticality certificate must "identify" M. To take the classical 
example where / is a pointwise maximum of smooth functions, around any 
critical point x, assuming a natural constraint qualification, we can define M 
as those points with the same corresponding "active set" of functions attaining 
the maximum. 

Identifiable sets M are useful computationally because the problem of min- 
imizing the function / near the critical point x is equivalent to minimizing the 
restriction of / to M, which may be an easier problem, and because the iden- 
tifiability property allows convergent algorithms to find M — the motivation 
for active set methods. We show moreover how M is a natural tool for opti- 
mality conditions: under reasonable conditions, quadratic growth of / around 
x is equivalent to quadratic growth on M — a potentially easier condition to 
check. 

Clearly the smaller the identifiable set M, the more informative it is. Ideal 
would be a "locally minimal identifiable set" . We note that such sets may fail 
to exist, even for finite convex functions /. However, when a minimal identi- 
fiable set M does exist, we show that it is both unique (locally), and central 
to sensitivity analysis: it consists locally of all critical points of small linear 
perturbations to /. We show furthermore that, under reasonable conditions, 
variational analysis of / simplifies because, locally, the graph of its sub-differ- 
ential mapping is influenced only by the restriction of / to M. One appealing 
consequence is a close relationship between minimal identifiable sets and crit- 
ical cones appearing in the study of variational inequalities. 

The case when an identifiable set M is in fact a manifold around the point 
x (as in the classical example above) is particularly interesting. Remarkably, 
this case is equivalent to a powerful but seemingly stringent list of properties 
known as "partial smoothness" [19] . nondegeneracy and prox-regularity - 
related work on "VU algorithms" and "the fast track" appears in 20,2TJ[22] 
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and By contrast, our approach here is to offer a concise mathematical 

development emphasizing how this important scenario is in fact very natural 
indeed. 

The outline of the paper is as follows. In Section [21 we introduce the no- 
tion of identifiability for arbitrary set-valued mappings. Then in Section O wc 
specialize this idea to subdiffcrcntial mappings, laying the foundation for the 
rest of the paper. Section H] contains basic examples of identifiable sets, while 
Section [5] establishes a calculus of identifiability, which could be skipped at 
first reading. Arriving at our main results in Section [6j we study variational 
geometry of identifiable sets; this in particular allows us to establish a strong 
relationship between identifiable sets and critical cones in Section [7J Finally in 
Section[5J we consider optimality conditions in the context of identifiable sets, 
while in Section [S] we establish a relationship between identifiable manifolds 
and partial smoothness — one of our central original goals. In Section 1101 
results of Section [5] are generalized to a function setting. 

2 Identifiability in set-valued analysis 

A set-valued mapping G from R" to R m , denoted by G: R n =4 R m , is a 
mapping from R™ to the power set of R m . Thus for each point x £ R", G(x) 
is a subset of R m . The domain, graph, and range of G are defined to be 

domG := {x £ R" : G(x) + 0}, 
gphG := {(x,y) £ R" x R m : y £ G(x)}, 

rgeG:= (J G(x), 

respectively. Observe that domG and rgeG are images of gphG under the 
projections (x, y) i-> x and (x, y) i-> y, respectively. 

For two sets M and Q in R™, we will say that the inclusion M C Q 
holds locally around a point x, if there exists a neighborhood U of x satisfying 

Mnu cQnu. 

The key property that we explore in this work is that of finite identification. 

Definition 2.1 (Identifiable sets) Consider a mapping G: R" =t R m . We 

say that a subset M C R" is identifiable at x for v, where v £ G(x), if the 
inclusion 

gphG C M x R" holds locally around (x,v). 

Equivalently, a set M is identifiable at x for v £ G(x) if for any sequence 
(xi,Vi) — > (x,v) in gphG, the points Xi must lie in M for all sufficiently large 
indices i. Clearly M is identifiable at x, for v if and only if the same can be 
said of M PI dom G. Hence we will make light of the distinction between two 
such sets. 

Clearly domG is identifiable at x for v £ G(x). More generally, if v lies in 
the interior of some set U, then G _1 (C/) is identifiable at x for v. Sometimes 
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all identifiable subsets of domG arise locally in this way. In particular, one 
can readily check that this is the case for any set- valued mapping G satisfying 
G _1 o G = Id, an important example being the inverse of the projection map 
G = Pq 1 onto a nonempty, closed, convex set Q. 

The "smaller" the set M is, the more interesting and the more useful it 
becomes. Hence an immediate question arises. When is the identifiable set M 
locally minimal, in the sense that for any other identifiable set M' at x for v, 
the inclusion M C M' holds locally around xl The following notion will be 
instrumental in addressing this question. 

Definition 2.2 (Necessary sets) Consider a set-valued mapping G : R™ =3 
R m , a point x G R™, and a vector v G G(x). We say that a subset M c R n , 
containing x, is necessary at x for v if the function 



restricted to M, is continuous at x. 

Thus M is necessary at x for v S G(x) if for any sequence Xi —> x in M, 
there exists a sequence Vi G G(a;j) with Vi — > v. The name "necessary" arises 
from the following simple observation. 

Lemma 2.3 Consider a set-valued mapping G: R™ =t R"\ a point x G R™, 
and a vector v G G(S). Lef M and M' 6e <wo subsets o/R™. T/ien t/ie impli- 
cation 



The following elementary characterization of locally minimal identifiable 
sets will be used extensively in the sequel, often without an explicit reference. 

Proposition 2.4 (Characterizing locally minimal identifiability) 

Consider a set-valued mapping G: R™ =^ R m and a pair (x,v) G gphG. The 
following are equivalent. 

1. M is a locally minimal identifiable set at x for v. 

2. There exists neighborhood V of v such that for any subneighborhood W C V 
of v, the representation 



x i-> d(v, G(x)), 




holds. 



M = G 1 (W) holds locally around x. 



3. M is a locally maximal necessary set at x for v. 
4- M is identifiable and necessary at x for v. 
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Proof (1) => (2) : Suppose M is a locally minimal identifiable set at x for v. 
Then there exists a neighborhood V of v satisfying 

G^iy) C M locally around x. 

In turn, the set G _1 (X^) is identifiable at x for v. Consequently, by local min- 
imality of M we deduce the equality M = G _1 (V) locally around x. Observe 
that the same argument is valid for any subneighborhood W C V of v. The 
result follows. 

(2) (3) : We first prove that M is necessary at x for v. Consider any 
sequence Xi — ¥ x in M and, for the sake of contradiction, suppose that there 
do not exist vectors Vi € G(xt) with Vi — > v. Then there exists a real number 
e > such that, restricting to a subsequence, we have G(xi) tl B e (v) = 0. Con- 
sequently, we obtain Xi £ G _1 (F D B € (v)) for all large indices i. We conclude 
that G _1 (V n B t (v)) and M do not coincide on any neighborhood of x, which 
is a contradiction. Hence M is necessary at x for v. 

Now consider any other necessary set M' at x for v and a sequence Xi — > x 
in M'. By necessity of M', there exist vectors Vi £ G(xi) converging to v. By 
the defining property of M, we deduce Xi € M for all large indices i. Hence 
the inclusion M' C M holds locally around x. We conclude that M is a locally 
maximal necessary set at x for v. 

(3) (4) : Consider any sequence (xi,Vi) — > (x,v) in gphG. Observe that 
the set 

oo 

M' := M\j{j{ Xi \, 

i=l 

is necessary at x for 5. Consequently, by local maximality of M, we have 
Xi G M for all large indices i. Thus M is identifiable at S for v. 

(4) ^> (1) : This is immediate from Lemma \2. 31 

Remark 2.5 It is clear from Proposition 12.41 that whenever locally minimal 
identifiable sets exist, they are locally unique. That is, if Mi and M-i are both 
locally minimal identifiable sets at x for v £ G(x), then we have Mi = M2 
locally around x. 

The central goal in sensitivity analysis is to understand the behavior of 
solutions x, around 5, to the inclusion 

v e G(x), 

as v varies near v. Characterization 2 of Proposition 12.41 shows that a locally 
minimal identifiable set at x for v is a locally minimal set that captures all the 
sensitivity information about the inclusion above. 

This characterization yields a constructive approach to finding locally min- 
imal identifiable sets. Consider any open neighborhoods Vi D V2 D V3 D . . . , 
around v with the diameters of Vi tending to zero. If the chain 



G-^Vi) D G-\V 2 ) D G- 1 ^) D . . . , 
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stabilizes, in the sense that for all large indices i and j, we have G _1 (T / i) = 
G _1 (Vj) locally around x, then G _1 (t^) is a locally minimal identifiable set 
at x for v, whenever i is sufficiently large. Moreover, the locally minimal iden- 
tifiable set at x for v, if it exists, must arise in this way. 

The following example shows that indeed a set-valued mapping can easily 
fail to admit a locally minimal identifiable set. 

Example 2.6 (Failure of existence) Consider the mapping G: R 2 =t R, 
defined in polar coordinates, by 



Let x be the origin in R 2 and v := £ G(x). Observe that for e — > 0, the 
preimages 



never coincide around x. Consequently, there is no locally minimal identifiable 
set at x for v. 

Notwithstanding the previous example, locally minimal identifiable sets do 
often exist. Before proceeding, we recall the following two notions of continuity 
for set- valued mappings. 

Definition 2.7 (Continuity) Consider a set-valued mapping F : R" =4 R"\ 

1. F is outer semicontinuous at a point x £ R™ if for any sequence of points 
Xi £ R™ converging to x and any sequence of vectors Vi £ F(xi) converging 
to v, we must have v £ F(x). 

2. F is inner semicontinuous at x if for any sequence of points Xi converging 
to x and any vector v £ F (x), there exist vectors Vi £ F(xi) converging to 
v. 

If both properties hold, then we say that F is continuous at x. We will say 
that F is inner-semicontinuous at x, relative to a certain set Q C R", if the 
condition above for inner- semicontinuity is satisfied for sequences Xi — s> x in 



Clearly inner-semicontinuous mappings always admit locally minimal iden- 
tifiable sets. 

Proposition 2.8 (Identifiability under continuity) 

Consider a set-valued mapping G: R™ =4 R m that is inner semicontinuous, 
relative to dom G, at a point x £ domG. Then domG is a locally minimal 
identifiable set at x for any vector v £ G(x). 

More interesting examples can be constructed by taking pointwise unions 
of maps admitting locally minimal identifiable sets. 




G~\-e,e) = {(r,8) £ R 2 : G(r,6)n(-e,e 



)^0} = {M):|fl|< e } 



Q- 
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Proposition 2.9 (Pointwise union) Consider a finite collection of outer- 
semicontinuous mappings, Gi : R n R m , for i = 1, . . . , k. Define the point- 
wise union mapping G : R" =4 R m to be 

m 

G{x) = |J Gi(x). 
i=i 

Fix a point x £ R" and a vector v £ G(x), and suppose that for each index 
i, satisfying v 6 Gi{x), there exists a locally minimal identifiable set Mi (with 
respect to Gi) at x for v. Then the set 

M := |J Mi, 

i:v£Gi(x) 

is a locally minimal identifiable set (with respect to G) at x for v. 

Proof This readily follows from Proposition ^. 41 

In particular, locally minimal identifiable sets exist for piecewise polyhedral 
mappings. These are those mappings whose graphs can be decomposed into a 
union of finitely many convex polyhedra. 

Example 2.10 (Piecewise polyhedral mappings) 

Consider a piecewise polyhedral mapping G: R n =4 R m , where gphG = 
Ui=i Vi an d Vi C R™ are convex polyhedral sets. It is easy to check that set- 
valued mappings whose graphs are convex polyhedral are inner-semicontinuous 
on their domains. Fix a point x € R" and a vector v £ G(x), and let 
7r: R" x R m — > R™ be the canonical projection onto R". Consequently, by 
Propositions 12.81 and 12.91 the set 

U «w 

is a locally minimal identifiable set at x, for v. 

For the remainder of the current work, we will be investigating the notion 
of identifiability in the context of the workhorse of variational analysis, the 
subdifferential mapping (Definition 13 - 5|) . 

3 Identifiability in variational analysis 

3.1 Preliminaries from variational analysis 

In this section, we briefly summarize some of the fundamental tools used 
in variational analysis and nonsmooth optimization. We refer the reader to 
the monographs Borwein-Zhu [3], Clarke-Ledyaev-Stern-Wolenski 7], Mor- 
dukhovich [2"3"ll24] . and Rockafellar-Wets [3U], for more details. Unless oth- 
erwise stated, we follow the terminology and notation of [50] . 
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The functions that we will be considering will take their values in the 
extended real line R := RU {—00,00}. We say that an extended-real- valued 
function is proper if it is never {—00} and is not always {+00}. 

For a function / : R" — > R, the domain of / is 

dom/ := {x £ R" : f(x) < +00}, 

and the epigraph of / is 

epi/ := {(x,r) £ R" x R : r > f(x)}. 

We will say that / is lower- semicontinuous (Isc for short) at a point x provided 
that the inequality liminf^s f(x) > f(x) holds. If / is lower-semicontinuous 
at every point, then we will simply say that / is lower-semicontinuous. 

Throughout this work, we will only use Euclidean norms. Hence for a point 
x £ R™, the symbol \x\ will denote the standard Euclidean norm of x. Given 
a set Q C R" and a point x £ Q, we let o(\x — x\) for x £ Q be shorthand for 

a function that satisfies °^ x ~~ x ^ — ^ whenever x % x with x ^ x. 

\x—x\ 1 

For a set Q C R" and a point x £ R™ , the distance of x from Q is 

d Q {x) := utf \x-y\, 

and the projection of x onto Q is 

P Q {x) :={y £ Q:\x- y\= d Q (x)}. 

Normal cones are fundamental in variational geometry. The most intuitive 
type of a normal cone arises from the metric projection. 

Definition 3.1 (Proximal normals) Consider a set Q C R" and a point 
x £ Q. The proximal normal cone to Q at x, denoted Nq(x), consists of 
all vectors v £ R™ such that x £ Pq(x + ^v) for some r > 0. In this case, 
{x} = Pq(x + -p-v) for each real number r' > r. 

Geometrically, a vector v ^ is a proximal normal to Q at x precisely 
when there exists a ball touching Q at x such that v points from x towards 
the center of the ball. Furthermore, this condition amounts to 

(v, x — x) < 0(\x — x\ 2 ) as x — > x in Q. 

Relaxing the inequality above, one obtains the following notion. 

Definition 3.2 (Frechet normals) Consider a set Q C R™ and a point x £ 
Q. The Frechet normal cone to Q at x, denoted Nq(x), consists of all vectors 
v £ R™ such that 

(v, x — x) < o(\x — x\) as x — > x in Q. 
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Note that both Nq(x) and Nq(x) are convex cones, while Nq(x) is also closed. 

For a set Q G R", the set-valued mapping x h4 Nq(x) is not outer- 
semicontinuous, and hence is not robust relative to perturbations in x. To 
correct for that, the following definition is introduced. 

Definition 3.3 (Limiting normals) Consider a set Q C R" and a point 
x G Q. The limiting normal cone to Q at x, denoted Nq(x), consists of all 

vectors v G R" such that there are sequences Xi -3- x, and i>i — > v with 

The limiting normal cone, as defined above, consists of limits of nearby 
Frechet normals. In fact, the same object arises if we only allow limits of 
nearby proximal normals. An important and favorable situation arises when 
the Frechet and limiting constructions coincide. 

Definition 3.4 (Clarke regularity of sets) A set Q C R" is said to be 
Clarke regular at a point x € Q if it is locally closed at x and every limiting 
normal vector to Q at x is a Frechet normal vector, that is Nq(x) = Nq(x). 

We can study variational properties of functions by means of normal cones 
to their epigraphs. 

Definition 3.5 (SubdifFerentials) Consider a function /: R" — > R and a 

point x G R" where / is finite. The proximal, Frechet, and limiting subdifFer- 
entials of / at x, respectively, are defined by 



dpf(x) 


= {v 


e R n 


(v, 


-1) 


£N c p pif (x,f(x))}, 


df(x) 


= {v 


G R" 


(v, 


-1) 


G N cpif (x,f(x))}, 


df(x) 


= {v 


e R" 


(v, 


-1) 


G N cpif (x,f(x))}, 



while the horizon subdifferential is defined by 

d°°f(x) ={»£ R" : (v, 0) G iVepi f (x, /(*))}. 

For x such that f(x) is not finite, we follow the convention that dpf(x) = 
8 f{x) = 8f(x) = 8°° f(x) = 0. 

For convex functions /, the subdifferentials d p f, df, and df reduce to 
the classical convex subdifferential, while for smooth / they coincide with 
the gradient mapping V/. See for example [301 Exercise 8.8]. In this sense, 
these three subdifFerentials generalize the classical gradient. The horizon sub- 
differential plays an entirely different role; it records horizontal normals to 
the epigraph of the function and is instrumental in establishing subdifferential 
calculus rules. See [301 Theorem 10.6]. 

Definition 3.6 (Clarke regularity of functions) A function /: R™ — > R 
is said to be Clarke regular at a point x G R" if the epigraph epi / is Clarke 
regular at (x, fix)). 
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Given any set Q C R™ and a mapping F: Q — >■ Q, where Q C R m , we say 
that F is C p -smooth (p > 1) if for each point x G Q, there is a neighborhood 
U of x and a C p mapping F : R™ —> R m that agrees with F on Q n t/. 

Definition 3.7 (Smooth Manifolds) We say that a set M C R™ is a C p - 

submanifold of dimension r if for each point x G M, there is an open neigh- 
borhood U around X and a function F: U —> R"~ r that is C p -smooth with 
WF(x) of full rank and satisfying M D U = {x G U : F(x) — 0}. In this case, 
we call F a local defining function for M around x. 

A good source on smooth manifold theory is |18j . 

Theorem 3.8 ( |30L Example 6.8]) Any C 1 -manifold M is Clarke regular 
at every point x € M and the normal cone Nm (x) is equal to the normal space 
to M at x, in the sense of differential geometry. 

We will have occasion to use the following simple result. 

Proposition 3.9 Consider a set M C R" and a function f : R" — > R that is 
finite-valued and C 1 -smooth on M . Then, at any point x € M, we have 

df(x)GVg(x)+N M (x), 

where g: R™ — > R is any C 1 -smooth function agreeing with f on a neighbor- 
hood of x in M . 

Proof Define a function h : R™ — > R agreeing with / on M and equalling plus 
infinity elsewhere. We successively deduce 

Bf(x) C dh(x) = d(g(-) + 6 M (-))(x) = Vg{x) + N M (x), 

as we need to show. 

For a set Q C R", we let 5q denote a function that is on Q and +oo 
elsewhere; we call Sq the indicator function of Q. Then for any point x e Q, 
we have Nq(x) — dp8Q(x), Nq(x) = 85q(x) and Nq(x) = 8Sq(x). 

3.2 Identifiability in variational analysis 

We are now ready to define the appropriate notion of identifiability in the 
context of optimization. 

Definition 3.10 (Identifiability for functions) 

Consider a function / : R" — > R, a point x S R™, and a subgradient v G df(x). 
A set M C R" is identifiable at x for v if for any sequences (xi, f(xi),Vi) —> 
(x, f(x), v), with Vi G df(xi), the points Xi must all lie in M for all sufficiently 
large indices i. 
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The definition above can be interpreted in the sense of Section [5] Indeed, 
consider a function /: R™ — > R and a subgradient v £ df(x), for some point 
x £ R™. Define the set- valued mapping 

G: R" =)RxR", 

x i } {f(x)} x df(x). 

Then M is identifiable (relative to /) at x for v if and only if it is identifiable 
(relative to G) at x for the vector (f(x),v). Here, we have to work with the 
mapping G, rather than the subdifferential mapping df directly, so as to facil- 
itate coherence between normal cone mappings and subdifferential mappings 
via epigraphical geometry. (See Proposition 13. 161 ) This slight annoyance can 
be avoided whenever / is subdiffcrcntially continuous at x for v. 

Definition 3.11 (Subdifferential continuity) A function /: R™ — s> R is 
subdifferentially continuous at x for v £ df(x) if for any sequences Xi — > x and 
Vi —¥ v, with Vi £ df(xi), it must be the case that f(xf) — > f(x). 

Subdifferential continuity of a function / at x for v was introduced in 
Pol Definition 1.14], and it amounts to requiring the function (x,v) i-t f(x), 
restricted to gph9/, to be continuous in the usual sense at the point (x,v). 
In particular, any lower-semicontinuous convex function is subdifferentially 
continuous [301 Example 13.30]. 

Similarly, we define necessary sets as follows. 

Definition 3.12 (Necessity for functions) Consider a function /: R" — } 
R. A set M C R n is necessary at x for v £ df(x) if both the function / and 
the mapping 

x ^ d(v,df(x)), 

restricted to M, are continuous at x. 

Specializing the characterization in Proposition 12.41 to this setting, we ob- 
tain the following. 

Proposition 3.13 (Characterizing locally minimal identifiability) 

Consider a function f : R" — > R, a point x £ R n , and a subgradient v £ df(x). 
Then the following are equivalent. 

1. M is a locally minimal identifiable set at x for v, 

2. There exists a neighborhood V of v and a real number e > such that 
for any subneighborhood W C V of v and a real number < e' < e, the 
presentation 

M = [df)- l {W) n {x £ R™ : \f(x) - f(x)\ < e'} holds locally around x. 

3. M is a locally maximal necessary set at x for v. 
4- M is identifiable and necessary at x for v 
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Definition 3.14 (Identifiability for sets) Given a set Q C R n , we will say 

that a subset M c Q is identifiable (relative to Q) at x for v G Nq(x) if M 
is identifiable (relative to Sq) at x for w € 85q(x). Analogous conventions will 
hold for necessary sets and locally minimal identifiable sets. 

It is instructive to observe the relationship between identifiability and the 
metric projection in presence of convexity. 

Proposition 3.15 (Identifiability for convex sets) 

Consider a closed, convex set Q and a subset M C Q. Let x G M and v G 
Nq(x). Then the following are equivalent. 

1. M is identifiable (relative to Q) at x for v. 

2. M is identifiable (relative to Pq 1 ) at x for x + v. 

Analogous equivalence holds for necessary sets. 

Proof Suppose that M is identifiable (relative to Q) at x for v. Consider a se- 
quence (xi,yi) — > (x,x + v) in gph Pq 1 ■ Observe Xi = PQ{y%) and the sequence 
yi— Xi G Nq(xi) converges to v. Consequently, the points Xi all eventually lie 
in M. 

Conversely suppose that M is identifiable (relative to Pq 1 ) at x for x + v. 
Consider a sequence (xi,Vi) — > (x,v) in gphA^Q. Then the sequence (xi,Xi + 
Vi) G gphPg 1 converges to (x,x + v). Consequently, we have x,- L G M for all 
large i. 

We leave the verification of the analogous equivalence for necessary sets to 
the reader. 

Thus a subset M of a closed, convex set Q is identifiable at x for v G Nq(x) 
if and only if the equality, Pq = Pm, holds locally around x + v. 

The following simple proposition establishes epigraphical coherence, al- 
luded to above, between normal cone mappings and subdifferential mappings 
in the context of identifiability. 

Proposition 3.16 (Epigraphical coherence) 

Consider a function f : R™ — > R and a subgradient v G df{x), for some point 
x G R™. Then M C dom/ is an identifiable set (relative to f) at x for v if 
and only i/gph/| M is an identifiable set (relative to epi/j at (x,f(x)) for 
(v, —1). Analogous statements hold for necessary, and consequently for locally 
minimal identifiable sets. 

4 Basic Examples 

In this section, we present some basic examples of identifiable sets. 

Example 4.1 (Smooth functions) If U is an open set containing x and 
/: U — > R is C 1 -smooth. Then U is a locally minimal identifiable set at x for 
V/Or). 
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Example 4.2 (Smooth manifolds) If M is a C 1 manifold, the M is a lo- 
cally minimal identifiable set at any x G M for any v G Nm{x). This follows 
immediately by observing that the normal cone mapping x <-> Nm (x) is inner- 
semicontinuous on M. 

We define the support of any vector v G R™, denoted suppv, to be the set 
consisting of all indices i G {1, . . . , n} such that u,; ^ 0. The rank of v, denoted 
rankv, is then the size of the support suppu. 

Example 4.3 (Convex polyhedra) Let Q C R™ be a convex polyhedron. 
Example 12.101 shows that M := Nq 1 ^) (equivalently, M := argmax^Q^, x)) 
is a locally minimal identifiable set at x for v. 

More concretely, suppose that Q has the representation 

Q = {x G R™ : (a t ,x) < h for all i G /}, (4.1) 

for some index set I — {1, . . . , m} and vectors Oj., . . . , a m G R™ and b G R m . 
For any point x G R", define the active index set 

I(x) := {i G I : (ffli,x) = h}. 

Then we have Nq(x) = conejai : i G I(x)} and consequently there exist 
multipliers A G R™ with supp A G I(x) satisfying v = Ylieifm) ^i a i- Hence for 
any point y G Q, the equivalence 

i£l(x) iel(x) 
i£/(x) 

holds. We deduce that M has the alternate description 

M = {x G Q : supp A C I(x)}. 

We should note that under a strict complementarity condition, v G riA r g(x), 
we may choose A with supp A = I(x). Then M would consist of all points 
x G Q, whose active index set I(x) coincides with I(x). It is then standard to 
check that M coincides with an affine subspace locally around x. 

Example 4.4 (Polyhedral functions) Analogously, we may analyse a con- 
vex polyhedral function / : R™ — > R, a function whose epigraph is a convex 
polyhedron. To be more precise, we may express / as 

f f\_( nia,Xi e j{{ai,x) + bt] whenever (cj, x) < dj for all j G J, 
[ oo otherwise, 

for some index sets / = {1, . . . , m} and J = {1, . . . , k}, vectors <Zj, Cj G R", 
and real numbers hi, dj for i G I and j G J. For any point x G R", define the 
active index sets 



I{x) = {i€l: (a,i,x) + h = f(x)}, 
J{x) = {j G J : (cj,x) = dj}. 
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A straightforward computation shows 

df(x) = convjai : i G I(x)} + conejcj : j £ J{x)}. 

Consider a pair (x, v) E gphdf. Then there exist multipliers (A, p.) £ R" 1 x R^ 
satisfying 

iei je.J 

with X^iei 1 = ^' SU PP^ C -T(S), and supp/i C J(x). Applying the same 
argument as in Example 14.31 to epi/, we deduce that the set 

M = {x € dom/ : suppA C I(x), supp/i C J{x)}, 

is a locally minimal identifiable set at x for v. Again we should note that a 
particularly nice situation occurs under a strict complementarity condition, 
v E ridf(x). In this case there exist multipliers (A, /u) so that suppA = I(x) 
and supp /2 = J(x), and then M coincides with an afhne subspace locally 
around x. 

Example 4.5 (Maximum function) In particular, consider the maximum 
function mx: R" — > R, defined by 

mx(x) := max{a;i, . . . , x n }. 

Then given a point x and a vector v € d(mx)(x), the set M = {x 6 R" : 
suppw C I(x)}, where 

I{x) :— {i : Xi — mx(i)}, 

is a locally minimal identifiable set at x for v. Alternatively, M admits the 
presentation 

M = {x G R" : multmx(x) > rankw} locally around x, 
where multmx(x) simply denotes the size of the set I{x). 

Generalizing beyond polyhedrality, we now consider the so-called piecewise 
linear- quadratic functions; these are those functions whose domain can be rep- 
resented as the union of finitely many convex polyhedra, so that the function is 
linear or quadratic on each such set. Convex piecewise linear-quadratic func- 
tions are precisely the convex functions whose subdifferential mappings are 
piecewise polyhedral [25] . 

Proposition 4.6 (Piecewise linear-quadratic functions) 

Consider a convex, piecewise linear- quadratic function f: R" — > R. Then there 
exists a locally minimal identifiable set at any point x € dom / for any vector 
V S df(x). 
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Proof Convex piecewise linear-quadratic functions have piecewise polyhedral 
subdifferential mappings [35]. Consequently, Example 12.101 shows that the 
mapping x M> df(x) admits a locally minimal identifiable set at any point 
x G R" for any vector v G df(x). Since piecewise linear-quadratic functions 
are lower-semicontinuous |30l Proposition 10.21], and lower-semicontinuous 
convex functions are subdiffcrcntially continuous 30, Example 13.30], the re- 
sult follows. 

We now briefly consider the three standard convex cones of mathematical 
programming. 

Example 4.7 (Non-negative Orthant) Consider a point x € R" and a 
vector v G JVr," (x). Then M :— {x G R" : x t = for each i G suppv} is a 
locally minimal identifiable set at x for v. Observe that M also admits the 
presentation 

M = {x G R" : ranka; + rank?; < n} locally around x. 

Example 4.8 (Lorentz cone) Consider the Lorentz cone 

£" {( x , r ) G R" x R : r > \x\}. 

Observe that £" coincides with the epigraph epi | • | . Let x — and consider 
any v G d\ ■ |(0) with \v\ = 1. Then for any real e > 0, the set M e :— {x G R" : 
(]fyjv) < e} is identifiable at x for v. In particular, for n > 2 and e ^ e' the 
sets M e and M e i do not coincide on any neighborhood of 5, and consequently 
there is no locally minimal identifiable set at x for v. 

In what follows S™ will denote the space of n x n real symmetric matrices 
with the trace inner product while S" will denote the convex cone of symmetric 
positive semi-definite matrices. With every matrix X G S™ we will associate 
its largest eigenvalue, denoted by Xi(X). The multiplicity of Xi(X) as an 
eigenvalue of X will be written as mult \i(X). Finally M™ xm will denote the 
space of n X m matrices with real entries. We defer the verification of the 
following two examples to a forthcoming paper [9]. We should also emphasize 
the intriguing parallel between these two examples and Examples 14.51 and 14.71 

Example 4.9 (Positive semi-definite cone) Consider a matrix X G S" 
and a normal V G N S ™(X). Then 

M = {X G S T l : r&nkX + rank? < n}, 

is an identifiable set at X for V. It is interesting to note that M may fail to 
be locally minimal in general. Indeed, it is possible that S™ admits no locally 
minimal identifiable set at X for V. This can easily be seen from the previous 
example and the fact that and C 2 are isometrically isomorphic. 

However, under the strict complementarity condition V G riiVs«(X), we 
have T&nkX + rank? = n, and consequently M coincides with {X G S™ : 
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rankJT = rankX} around X. It is then standard that M is an analytic man- 
ifold around X, and furthermore one can show that M is indeed a locally 
minimal identifiable set at X for V. For more details see [9]. 

Example 4.10 (Maximum eigenvalue) Consider a matrix X and a sub- 
gradient V G dX\(X), where Ai : S n — > R is the maximum eigenvalue function. 
Then 

M := {IeS": mult Ai(X) > rankF}, 

is an identifiable set at X for V. Again under a strict complementarity con- 
dition V G ridXi(X), we have r&nkV — multAi(X), and consequently M 
coincides with the manifold {X £ S n : multAi(AL) = multAi(A > )} locally 
around X. Furthermore under this strict complementarity condition, M is 
locally minimal. For more details see [5]. 

Example 4.11 (The rank function) Consider the rank function, denoted 
rank : M" xm R. Then 

M := {X G M nxm : rankX = rankX} 

is a locally minimal identifiable set at X for any V € 9(rank)(A > ). To see this, 
observe that the equality 

epirank = epi (rank X + 5m) holds locally around (X, rank X). 

Combining this with the standard fact that M is an analytic manifold verifies 
the claim. 



In Examples 14.81 and 14.91 we already saw that there are simple functions 
/ : R™ — > R that do not admit a locally minimal identifiable set at some point 
x for v 6 df(x). However in those examples v was degenerate in the sense 
that v was contained in the relative boundary of df(x). We end this section 
by demonstrating that locally minimal identifiable sets may, in general, fail 
to exist even for subgradients v lying in the relative interior of the convex 
subdifferential df(x). 

Example 4.12 (Failure of existence) 

Consider the convex function / : R 2 — > R, given by 



f(x,y) = \/x 4 + y 2 . 
Observe that / is continuously differentiable on R 2 \ {(0,0)}, with 

a; 4 + y z 

and 

9/(0, 0) = {0}x [-1,1]. 

We claim that / does not admit a locally minimal identifiable set at (0, 0) 
for the vector (0,0) G 9/(0,0). To see this, suppose otherwise and let M be 
such a set. 
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Consider the curves 

L n := {(x, y)eR 2 :p ^-a: 2 }, 

parametrized by integers n. For a fixed integer n, consider a sequence of points 
(xuVi) -> (0,0) in L n . Then 



lim \Vf(xi,yi)\ 



n 2 



n 4 + f 



Since M is necessary at (0,0) for (0,0), we deduce that for each integer n, 
there exists a real number e n > such that 

B e „nL„nM = {(o,o)}. 

2 

However observe linin^oo = 0. Therefore we can choose a sequence 
(x n ,y n ) G B £?i nL„, with (x„, y„) ^ (0,0), (x n ,y n ) -> (0,0), and the gradients 
V/(a;„,j/„) tending to (0,0). Since M is identifiable at (0,0) for (0,0), the 
points (x n ,y n ) lie in M for all large indices n, which is a contradiction. 



5 Calculus of identifiability 

To build more sophisticated examples, it is necessary to develop some calculus 
rules. Our starting point is the following intuitive chain rule. 

Proposition 5.1 (Chain Rule) Consider a function f(x) :— g(F(x)) de- 
fined on an open neighborhood V C R™, where F : V —> R m is a C 1 -smooth 
mapping and g : R™ — > R is a Isc function. Suppose that at some point x £ 
dom/ ; the qualification condition 

kcr VF(x)* n d°°g(F(x)) = {0}, (5.1) 
is valid, and hence the inclusion 

df(x) C VF(x)*dg(F(x)) holds. 

Consider a vector v e df(x) and the corresponding multipliers 

A := {y e dg(F(x)) : v = VF(x)*y}. 

Suppose that for each vector y E A, there exists an identifiable set M y (with 
respect to g) at F(x) for y. Then the set 

M:=\jF-\M y ), 

yeA 

is identifiable (with respect to f) at x for v. 
If, in addition, 

— g is Clarke regular at all points in domg around F(x), 
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— the collection {M y } y ^s is finite, and 

— each set M y is a locally minimal identifiable set (with respect to g) at F(x) 
for y, 

then M is a locally minimal identifiable set (with respect to f) at x for v. 

Proof We first argue the identifiability of M. To this effect, consider any se- 
quence (xi, f(xi),Vi) — y (x, f(x), v), with Vi G df(xi). It is easy to see that the 
transversality condition 

ker VF(xi)* n d°°g(F(xi)) = {0}, (5.2) 

holds for all sufficiently large indices i. Then by [301 Theorem 10.6], we have 

Vi G df{xi) C VF{xiydg{F{ Xi )). 

Choose a sequence G dg(F(xi)) satisfying v% = V F{xi)*yi. We claim that 
the sequence y\ is bounded. Indeed suppose otherwise. Then restricting to a 
subsequence, we can assume \yi\ — > oo and y*^ — > y, for some nonzero vector 
y E d°°g(F(x)). Consequently 



VF(x)*y = lim VFfi,)*^ = lim = 0, 

thus contradicting (|5.2[) . 

Now restricting to a subsequence, we may suppose that the vectors yi £ 
dg(F(xi)) converge to y for some vector y £ dg(F(x)). Furthermore, observe 
y € A. So for all sufficiently large indices i, the points F(xi) all lie in M s . 
Consequently the points x\ lie in M for all large indices i, and we conclude 
that M is identihable (with respect to /) at x for v. 

Now suppose that g is Clarke regular at all points of dom g near F(x), the 
collection {Myj^gs is finite, and each set M y is a locally minimal identifiable 
set (with respect to g) at F(x) for y. We now show that M is necessary (with 
respect to /) at x for v. To this effect, consider a sequence xi — >• 5 in M. 
Then restricting to a subsequence, we may suppose that the points F(xi) all 
lie in M y for some y G A Consequently there exists a sequence G dg(F(xi)) 
converging to y. Hence we deduce 

Vi := VF( Xi )* yi -> VF(i)^" = B. 

Since g is Clarke regular at all points of domg near F(x), by [301 Theorem 
10.6], the inclusion Vi G df(xi) holds for all large i. Hence M is necessary 
(with respect to /) at x for v. 



Our goal now is to obtain a sum rule. The passage to this result though the 
chain rule is fairly standard. The first step is to deal with separable functions. 
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Proposition 5.2 (Separable functions) 

Consider proper, Isc functions fi : R Ki — > R, for i = 1, . . . , k, and define 

k 

f(X!,...,X k ) =^2fi{Xi). 

i=l 

Suppose that M Si C R ni is an identifiable set (with respect to fi) at Xi for 
Vi G dfi(xi), for each i = 1, . . . , k. Then the set 

M := M Sl x ... x M Sfc , 

is identifiable (with respect to f) at x — (x\, . . . ,Xk) for v = (vi, . . . ,Vk)- An 
analogous result holds for necessary sets. 

Proof Clearly M := M Sl x . . . x M Cfc is identifiable for the set- valued mapping 

k 

(x!,...,x k ) ' ^ JJ{/»(a;i)} x dfi(xi), 

i=l 

at x for Yli=i (Ji{x-i),Vi). Furthermore lower-semicontinuity of the functions fi 
readily implies that M is also identifiable for 

k 

(xi,...,x k ) i-> {/(z)} x Y[dfi(xi), 

i=l 

at x for (f(x), v). Using the identity df(x\, . . . , Xk) — IliLi ®fi( x i)i we deduce 
the result. The argument in the context of necessary sets is similar. 

Corollary 5.3 (Sum Rule) Consider proper, Isc functions fi: R" — > R, for 
i = l,...,k, and define the sum f(x) = Yli=ifi( x )- Assume at some point 
x G dom /, the qualification condition 

k 

Vi = and Vi G df°°(x) for each i => Vi = for each i. 

i=l 

Consider a vector v G df(x) and define the set 

k k 

A = {(«!, . . . , v fc ) € IJ : t; = ^ «<}■ 

i=l i=l 

For eac/i (t;i, . . . , v k ) G A, let M Vi be an identifiable set (with respect to fi) at 
x for Vi . Then 

M:= (J M Vl n . . . n M Vk , 

(vi,...,v k )eA 

is identifiable (with respect to f) at x for v. 
If, in addition, 
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— each fi is Clarke regular at all points in dom/i around x, 

— the collection {M Vl x . . . x M Vk }r Vlt . Vh )^A * s finite, and 

— for each (v±,..., Vk) G A, the set M Vi is a locally minimal identifiable set 
( with respect to fi) at x for Vi , 

then M is a locally minimal identifiable set (with respect to f) at x for v. 

Proof We may rewrite / in the composite form go F, where F(x) :— (x, . . . , x) 
and g{x\, . . . ,Xk) '■= S»=i fi( x i) i s separable. Then applying Proposition 15.11 
and Proposition 15.21 we obtain the result. 



In particular, we now obtain the following geometric version of the chain 
rule. 

Proposition 5.4 (Sets with constraint structure) 

Consider closed sets Q £ R" and K S R™, and a C 1 smooth mapping 
F : R" -> R m . Define the set 

L = {xeQ: F(x) G K}. 

Consider a pair (x,v) £ gphA^ and suppose that the constraint qualification 



y e N Q (x),we N K (F(x)) 
y + \7F(x)*w = 



(y,w) = (0,0), 



holds. Define the set 

A = {(y,w) e N Q (x) x N K (F(x)) : y + \7F(x)*w = v}, 

and for each pair (y,w) G A, let M y be an identifiable set (relative to Q) at x 
for y and let K w be an identifiable set (relative to K) at F(x) for w. Then 

M:= (J M v nF-\K w ), 

is identifiable (relative to L) at x for v. 
If, in addition, 

— Q ( respectively K) is Clarke regular at each of its point near x ( respectively 

— the collection {M y x K w }^ V:W ^a is finite, 

— for each (y,w) G A, the set M y (respectively K w ) is a locally minimal 
identifiable set with respect to Q (respectively K) at x for y (respectively 
at F{x) for w ), 

then M is a locally minimal identifiable set (relative to L) at x for v. 

Proof Observe 5l = 5q + Sp-iijcy Combining Proposition 15.11 and Corol- 
lary [531 we obtain the result. 
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Corollary 5.5 (Max-type functions) 

Consider C^-smooth functions fi : R™ — > R, for i G I := {1, . . . , m}, and let 
f(x) :— max{/i(a;), f%{x), . . . , f m (x)}. For any x G R", define the active set 

I(x) = {i G I : f(x) = fi(x)}. 

Consider a pair (x, v) G gph<9/, and the corresponding set of multipliers 

yl = {AeR m :5= X i V M%), suppAc/(x)}. 

iei(s) 

Then 

M — |J {x G R™ : supp A C I(x)}, 

is a locally minimal identifiable set (relative to f) at x for v. 

Proof This follows directly from Proposition 15.11 and Example 14.41 by writing 
/ as the composition mx o F, where F(x) — (f\(x), . . . , f m (x)). 

Corollary 5.6 (Smooth constraints) 

Consider C 1 -smooth functions gi \ R™ — > R, for i £ I := {l,...,m} 7 and 
define the set 

Q = {x £ R™ : Qi(x) < for each i 6 /}. 
For any x G R™, define the active set 

I(x) = {i G / : gi (x) = 0}. 

and suppose that for a certain pair (x, v) G gph Nq, the constraint qualification 

Xi"Vgi(x) = and \ > for all i G I(x) A.; = for all i G I(x), 

iei(s) 

holds. Then in terms of the Lagrange multipliers 

A := {A G R m : v = ^ Wg l {x) 1 supp A C I(x)}, 
iei(s) 

the set 

M = [J {x e Q : gj{x) = for each j G supp A}, 
xeA 

is a locally minimal identifiable set (relative to Q) at x for v. 

Proof This follows immediately from Proposition 15.11 and Example 14.71 

We end the section by observing that, in particular, the chain rule, estab- 
lished in Proposition 15. 11 allows us to consider the rich class of fully amenable 
functions, introduced in |25j . 
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Definition 5.7 (Fully amenable functions) 

A function / : R" — > R is fully amenable at x if / is finite at 5, and there is 
an open neighborhood U of x on which / can be represented as / — g o F for 
a C 2 -smooth mapping F : V — > R m and a convex, piecewise linear-quadratic 
function g : R m — > R, and such that the qualification condition 

keiVF(x)* ndg°°(F(x)) = {0}, 

holds. 

The qualification condition endows the class of fully amenable functions 
with exact calculus rules. Such functions are indispensable in nonsmooth sec- 
ond order theory. For more details, see [25] . 

Proposition 5.8 (Identifiable sets for fully amenable functions) 

A function f : R n — » R that is fully amenable at a point x £ R" admits a 
locally minimal identifiable set at x for any vector v £ df(x). 

Proof This follows immediately from Propositions 14.61 and 15.11 
6 Variational geometry of identifiable sets 

In the previous sections, we have introduced the notions of identifiability, an- 
alyzed when locally minimal identifiable sets exist, developed calculus rules, 
and provided important examples. In this section, we consider the interplay 
between variational geometry of a set Q and its identifiable subsets M. Con- 
sidering sets rather than functions has the advantage of making our arguments 
entirely geometric. We begin with the simple observation that locally minimal 
identifiable sets are locally closed. 

Proposition 6.1 Consider a closed set Q C R n and a subset M C Q that is 
a locally minimal identifiable set at x for v € Nq(JE). Then M is locally closed 
at x 

Proof Suppose not. Then there exists a sequence € (bd M)\M with Xi — > x. 
Since M is identifiable at x for v, there exists a neighborhood V of v satisfying 
c\V n Nq(xi) = for all large indices i. Observe that for each index i, every 
point y sufficiently close to Xi satisfies V (1 Nq(ij) = 0. Consequently, there 
exists a sequence yt G Q converging to x with VCiNQ(y) — 0, which contradicts 
the necessity of M at x for v. 

Recall that for a set Q C R n and a subset M C Q, the inclusion Nq(x) C 
Nm(x) holds for each point x £ M, while the analogous inclusion for the 
limiting normal cone may fail. This pathology does not occur for identifiable 
sets. 

Proposition 6.2 Consider a closed set Q and a set M that is identifiable at 
x for v £ Nq (x) . Then the equation 

gphA^Q C gphiVjv/ holds locally around (x,v). 
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Proof Consider a sequence G gphiVQ converging to {x,v). Then for 

each i, there exists a sequence (2^,1^) 6 gphNQ converging to {xi,Vi). For 
sufficiently large indices i, the points x\ lie in M for all large j. For such indices 
we have v\ S Nq(xI) C Nm{xI), and consequently <E NM(xi). This verifies 
the inclusion gphiVQ C gphA^M locally around (5,^). 

It turns out that in order to make further headway in studying properties 
of identifiable sets, one must impose the condition of prox-regularity. This 
concept has been discovered and rediscovered by various authors, notably by 
Federer [T2] and Rockafellar-Poliquin We follow the development of [2"6] 
and EZ]. 

Definition 6.3 (Prox-regularity) We say that a set M C R™ is prox- 
regular at x £ M if it is locally closed around x and there exists a neighborhood 
U of cc, such that the projection map Pm is single- valued on U. 

Prox-regularity unifies the notions of convex sets and C 2 -manifolds. A 
proof may be found in 30, Example 13.30, Proposition 13.32]. 

Theorem 6.4 (Prox-regularity under convexity and smoothness) 

Convex sets and C 2 -manifolds are prox-regular at each of their points. 

It will be useful to consider a variant of prox-regularity where we consider 
localization with respect to direction [2HI Definition 2.10]. 

Definition 6.5 (Directional prox-regularity for sets) 

Consider a set Q C R", a point x e Q, and a normal vector v G Nq(x). We 
say that Q is prox-regular at x for v if Q is locally closed at x and there exist 
real numbers e > and r > such that the implication 



x 

holds. 



xeQ, veN Q {x)\ (r + r-^A 

-x\<e,\v-v\<ef ^ FQnB ^ (X + T V > 



Observe that if the implication above holds for some r, e > 0, then it also 
holds for any r' , e' > with r 1 > r and e' < e. 

In particular, a set Q is prox-regular at x, in the sense of Definition 16.31 if 
and only of Q is prox-regular at x for every vector v £ Nq(x). Clearly, if Q is 
prox-regular at x for v, then the equalities 

gph Nq = gph Nq — gph Nq hold locally around (5, v). 

The following characterization Corollary 3.4] will be of some use for 

us. 

Proposition 6.6 (Prox-regularity and monotonicity) 

For a set Q C R™ and x 6 Q, with Q locally closed at x, the following are 
equivalent. 

1. Q is prox-regular at x for v. 



21 



D. Drusvyatskiy, A. S. Lewis 



2. The vector v is a proximal normal to Q at x, and there exists a real number 
r > satisfying 

(vi - w , xi - xo) > -r\xi - x \ 2 , 
for any pairs (xi, Vi) G gphiVQ (for i = 0, 1) near {x, v). 

So Q is prox- regular at x for a proximal normal v G Nq(x) as long as Nq + rl 
has a monotone localization around (x, v+rx), for some real number r > 0. We 
may talk about prox-regularity of functions by means of epigraphical geometry 
[HI Theorem 3.5]. 

Definition 6.7 (Directional prox-regularity for functions) 

We say that a function /: R" — >• R is prox-regular at x for v € df(x) if the 
epigraph epi/ is prox-regular at (x, f{x)) for the vector (u, — 1). 

The following proposition shows that prox-regularity of an identifiable sub- 
set M of a set Q implies that Q itself is prox-regular. 

Proposition 6.8 (Prox-regularity of identifiable sets) 

Consider a closed set Q and a subset M C Q that is identifiable at x for 
v G Nq(x). In addition, suppose that M is prox-regular at x for v. Then Q is 
prox-regular at x for v. 



Proof To show that Q is prox-regular at x for v, we will utilize Proposition ^. 6 
To this end, we first claim that the inclusion v G Nq(x) holds. To see this 
choose a sequence of real numbers fj — > oo and a sequence of points 



Xi G Pq (x - 



Ts 1 v) 



We have 

Vi := n{x — Xi) + v G Nq{xi). 

Clearly Xi — > x. We now claim that the sequence Vi converges to v. Observe 
by definition of Xi, we have 

\{x — Xi) + r~ 1: v\ < \r^ v\. 
Squaring and cancelling terms, we obtain 

2{v, x - x^ < —n\x - x t \ 2 . 
Combining this with the inclusion v G Nq(x), we deduce 

0\X — Xi) , X — Xi , ._ I 

-p^ — r < 2(v, -p— — | ) < -n\x - Xi\ 

\X Xi \X Xi\ 

We conclude (x — x t ) — > and consequently v t — >■ v. Since M is identifiable 
at x for v, we deduce i, G M for all large indices i. In addition, since M is 
prox-regular at x for we have 

Xi = PMnB c (s)(x + rl x v) = x, 
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for some e > and for sufficiently large indices i. Hence the inclusion v S 
N£(x) holds. 

Now since M is prox-regular at x for v we deduce, using Proposition 16.61 
that there exists a real number r > satisfying 



(vi - v Q , xi - x ) > -r\xi - x 



2 



for any pairs (xi,Vi) £ gphNM (for i = 0, 1) near (x, v). 
By Proposition 16.21 we have 

gphiVQ C gphiVjv/ locally around (x,v). 

Recalling that v is a proximal normal to Q at x and again appealing to Propo- 
sition we deduce that Q is prox-regular at x for v. 



Remark 6.9 In Proposition 16.81 we assumed that the vector v is a Frechet 
normal. Without this assumption, the analogous result fails. For instance, 
consider the sets 

Q:={(x,y)eR 2 :xy = 0}, 

M:={(x,y) eR 2 :y = 0}. 

Then clearly M is identifiable at x := (0, 0) for the normal vector v :— (0, 1) 6 
Nq(x). However, v is not a proximal normal. 

The following result brings to the fore the insight one obtains by combining 
the notions of identifiability and prox-regularity. It asserts that given a prox- 
regular identifiable set M at x for v S Nq(x), not only does the inclusion 
gphiVQ C gphTVjv/ hold locally around (x,v), but rather the two sets gphA^Q 
and gphA^A/ coincide around (x,v). 

Proposition 6.10 (Reduction I) Consider a closed set Q and let M C Q 

be a set that is prox-regular at a point x, for v € Nq (x) . Then M is identifiable 
at x for v if and only if 

gphNQ = gphiVjv/ locally around (x,v). 

Proof We must show that locally around (x,v), we have the equivalence 

gph7V Q C Mx R" gph N Q = gphN M . 

The implication "<=" is clear. Now assume gphA^Q C M x R" locally around 
(x, v). By prox-regularity, there exist real numbers r, e > so that Pq(x + 
r _1 u) = x (Proposition [6?8]) and so that the implication 

xeM, v€N M (x)\ ( x + r- l v)-x 

\x - x\ < e, \v - v\ < e J ^ ^™*^ x + r v > ~ x ' 

holds. By Proposition 16.21 it is sufficient to argue that the inclusion 

gphAOv/ C gphiVQ holds locally around (x,v). 
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Suppose this is not the case. Then there exists a sequence (xi,Vi) — > (x,v), 
with (xi,Vi) G gphN M and (xi,Vi) g gphN Q . Let Zi G Pq{x 1 +r' 1 v i ). We 
have 

(xi - z i )+r~ 1 v l G Ng(zi), 

Xi ^ Zi. (6.1) 

Observe — » x by the continuity of the projection map. Consequently, by the 
finite identification property, for large indices i, we have z, G M and 

.T, + r _1 Wi e Zi + Ng(zi) Cz, + N M (zi). 

Hence z% — PunB^^ixi + r~ 1 Vi) — Xi, for large i, thus contradicting (|6.1I) . 

Recall that Proposition 16.81 shows that prox-regularity of an identifiable 
subset M C Q is inherited by Q. It is then natural to consider to what extent 
the converse holds. It clearly cannot hold in full generality, since identifiable 
sets may contain many extraneous pieces. However we will see shortly that the 
converse does hold for a large class of identifiable sets M, and in particular 
for ones that are locally minimal. The key tool is the following lemma, which 
may be of independent interest. 

Lemma 6.11 (Accessibility) Consider a closed set Q C R" and a subset 
M C Q containing a point x. Suppose that for some vector v G Nq(x), there 
exists a sequence 

Vi G N&(x) \ N^(x) with Vl -> v. 

Then there exists a sequence (xi, Vi) E gph Nq converging to (x, v) with Xi ^ M 
for each index i. 

Proof For each index i, there exists a real r\ > satisfying Pm(x + yi) — 
{x}. Furthermore we can clearly assume — > oo. Define a sequence {x%,Vi) G 
gphiVg by 

Xi G Pq(x + r^ 1 yi) and v t :— r t (x - Xi) + y t . 

Observe x% ^ M since otherwise we would have x% = x and yi = riVi G Nq(x), 
a contradiction. By continuity of the projection Pq, clearly we have Xi — > x. 
Now observe 

\(x - x.^ + rl l yi\ < r^ l \yi\. 
Squaring and simplifying we obtain 

n\x- Xi\+ 2( < 0. 

Since v is a Frechet normal, we deduce 

liminf ( — - n Vi) = liminf (jz — ^-r,v) > 0. 

i->oo \\X — Xi\ I i^oo \\X — Xi\ I 

Consequently we obtain ri\x — Xi\ — > and Vi —¥ v, as claimed. 
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Proposition 6.12 (Reduction II) Consider a closed set Q C R™, a point 
x, and a normal v £ Nq(x). Suppose gphTVg = gph Nq locally around (x,v), 
and consider a set M :— Nq 1 ^), where V is a convex, open neighborhood of 
v. Then the equation 

gphA^Q = gph Nm holds locally around (x,v). 

Proof First observe that since M is identifiable at x for v, applying Propo- 
sition 16.21 we deduce that the inclusion gphA^Q C gphTVM holds locally 
around (x, v). To see the reverse inclusion, suppose that there exists a pair 
(x,v) £ gphA^, arbitrarily close to (x,v), with v £ V and v ^ Nq(x). By 
definition of M, we have Nq(x) DV^$. Let z be a vector in this intersec- 
tion, and consider the line segment 7 joining z and v. Clearly the inclusion 
7 C V n N^j(x) holds. Observe that the line segment 7 n Nq(x) is strictly 
contained in 7, with z being one of its endpoints. Let w be the other endpoint 
of 7 n Nq (x) . Then we immediately deduce that there exists a sequence 

y, £ N^(x) \ Nq(x) with y, -> w. 

Applying Lemma 16.111 we obtain a contradiction. Therefore the inclusion 
gphiVQ D gphTV^ holds locally around (x,v). Taking the closure the result 
follows. 

In particular, we obtain the following essential converse of Proposition 16. 81 

Proposition 6.13 (Prox-regularity under local minimality) 

Consider a closed set Q and a subset M C Q that is a locally minimal identi- 
fiable set at x for v £ Nq(x). Then Q is prox-regular at x for v if and only if 
M is prox-regular at x for v. 

Proof The implication was proven in Proposition 16.81 To see the reverse 
implication, first recall that M is locally closed by Proposition 16.11 Further- 
more Propositions 13.131 shows that there exists an open convex neighborhood 
V of v so that M coincides locally with N^iV). In turn, applying Proposition 
16. 121 we deduce that the equation 

gphTVQ = gphA^Af holds locally around (x,v). 

Finally by Proposition l6.6l prox-regularity of Q at x for v immediately implies 
that M is prox-regular at x for v. 

We end this section by exploring the strong relationship between identifi- 
able sets and the metric projection map. We begin with the following propo- 
sition. 

Proposition 6.14 (Identifiability and the metric projection) 

Consider a closed set Q and a subset M C Q. Let x £ M and v £ Nq(x). 
Consider the following conditions. 
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1. M is identifiable (relative to Q) at x for v. 

2. For all sufficiently small A > 0, the set M is identifiable (relative to Pq 1 ) 
at x for x + Xv . 

Then the imvlication IT1 =>H?1 holds. If in addition Q is prox-regular at x for v, 
then the eauivalence\T\<Fp2\ holds. 

Proof [T1=^nil Recall that for all small A > 0, we have Pq(x + Xv) = x. Fix such 
a real number A and consider a sequence (xi,yi) — > (x,x + Xv) in gph Pq 1 - 
Observe Xi G -Pq(j/j) and the sequence X~ 1 (yi — Xi) € Nq[x^ converges to v. 
Consequently, the points Xi all eventually lie in M. 
Suppose now that Q is prox-regular at x for v. 

r21^m We may choose A, e > as in Definition 16.51 and satisfying Pq(x + 
Xv) — x. Consider a sequence (xi,vt) —> (x,v) in gphiVQ. Then the sequence 
(xi,Xi + Xvi) converges to (x,x + Xv) and lies in gphPg 1 for all sufficiently 
large i. Consequently, we have Xi G M for all large i. 

Assuming prox-regularity, a simple way to generate identifiable subsets 
M G Q is by projecting open sets onto Q. 

Proposition 6.15 (Projections of neighborhoods are identifiable) 

Consider a set Q C R n that is prox-regular at x for v G Nq(x). Then for all 
sufficiently small X > 0, if the inclusion i + AC£ int U holds for some set U , 
then Pq(U) is identifiable at x for v. 

Proof Suppose (a;,-,i>j) — ¥ (x,v) in gphiVg. Then by prox-regularity for all 
sufficiently small A > 0, we have Pq(xi + Xvi) — Xi and Xi + Xvi G U for all 
large i. We deduce Xi G Pq{U) for all large i, as we needed to show. 

In fact, we will see shortly that under the prox-regularity assumption, all 
locally minimal identifiable sets arise in this way. 

Lemma 6.16 Consider a closed set Q C R.™ and a subset M that is identifi- 
able at x for v G Nq(x). Then for all sufficiently small A > and all e > 0, 
the inclusion 

x + Xv € int {x + Xv : x G M, v G Nq(x), \x — x\ < e, \v — u| < e}, 

holds. 

Proof For all sufficiently small A > 0, we have Pq(x + Xv) — x. Consider a 
sequence Zi — > x + Xv and choose points 

Xi G Pq{zi). 

Observe x^ — > x and the vectors X~ 1 (zi — x-i) G Nq(xi) converge to v. Conse- 
quently, the points Xi lie in M for all large i, and hence = Xi + X(X^ 1 (zi~ Xi)) 
lie in the desired set eventually. 
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Proposition 6.17 (Representing locally minimal identifiable sets) 

Consider a set Q C R ra that is prox-regular at x for v G Nq(x) and let M be 
a locally minimal identifiable set at x for v. For A, e > 0, define 



Then for all sufficiently small A, e > 0, we have x + Xv G int U and M admits 
the presentation 



Proof Using prox-regularity and Lemma 16.111 we deduce that for all suffi- 
ciently small A, e > we have 



and x + Xv G int U. Using the fact that M is locally minimal at x for v, it is 
easy to verify that M and Pq(U) coincide locally around x. 

Finally, we record the following characterization of identifiable sets. 

Proposition 6.18 (Characterization of identifiable sets) 

Consider a closed set Q C R n } a subset M C Q, a point x G M , and a normal 
vector v £ Nq(x). Consider the properties: 

1. M is identifiable at x for v. 

2. Q is prox-regular at x for v and for all sufficiently small A > and all 
e > the inclusion 

x + Xv G int {x + Xv : x G M, v G Nq{x), \x — x\ < e, \v — v\ < e}, 



Then the implication © (HJ holds. If M is prox-regular at x for v, then we 
have the equivalence (jTJ) <^=> (J2]) . 

Proof ([2) =>■ (HJ): By Proposition 16. 15[ for all sufficiently small A > and all 

e > 0, the set 



is identifiable at x for v. Furthermore, for all sufficiently small A, e > this set 
is contained in M locally around x. Consequently M is identifiable at x for v. 
Now suppose that M is prox-regular at x for v. 

© =^ © : This follows trivially from Proposition ^. 81 and Lemma [6.161 



U := {x + Xv : x e M,v G Nq(x), \x — x\< e, \v - v\ < e}. 



M = Pq(U) locally around x. 




holds. 



Pq({x + Xv : x G M,v G N Q (x), \x — x\ < e, \v - v\ < e} 



30 



D. Drusvyatskiy, A. S. Lewis 



7 Identifiable sets and critical cones 

In this section, we consider critical cones, a notion that has been instrumental 
in sensitivity analysis, particularly in connection with polyhedral variational 
inequalities. See Section 2E] for example. We will see that there is a strong 
relationship between these objects and locally minimal identifiable sets. We 
begin with the notion of tangency. 

Definition 7.1 (Tangent cones) Consider a set Q C R™ and a point ieQ. 
The tangent cone to Q at x, written Tq(x), consists of all vectors w such that 

w = lim — , for some Xi Q x, r% \- 0. 

i— ► oo T i 

The tangent cone is always closed but may easily fail to be convex. For any 
cone K 6 R", we consider the polar cone 

K* := {y : (y, v) < for all v £ K}. 

It turns out that the sets clconvTg(a;) and Nq(x) are dual to each other, that 
is the equation 

N Q (x) = T Q (x)\ 

holds [30 :i Theorem 6.28]. Consequently if Q is locally closed at x, then Q is 
Clarke regular at x if and only if the equation Nq(x) = Tq(x)* holds. 
A companion notion to tangency is smooth derivability. 

Definition 7.2 (smooth derivability) Consider a set Q and a point x G Q. 
Then a tangent vector w 6 Tq{x) is smoothly derivable if there exists a C 1 - 
smooth path 7: [0, e) — > Q satisfying 

w = lim , 

44.0 t 

where e > is a real number and 7(0) = x. We will say that Q is smoothly 
derivable at x if every tangent vector w S Tq(x) is smoothly derivable. 

We should note that there is a related weaker notion of geometric deriv- 
ability, where the path 7 is not required to be C 1 -smooth. For more details 
see [301 Definition 6.1]. 

Most sets that occur in practice are smoothly derivable. In particular, any 
smooth manifold is smoothly derivable at each of its point, as is any semi- 
algebraic set Q C R n . We omit the proof of the latter claim, since it is a 
straightforward consequence of the curve selection lemma [311 Property 4.6] 
and the details needed for the argument would take us far off field. For a nice 
survey on semi-algebraic geometry, see [8]. 

We now arrive at the following central notion. 

Definition 7.3 (Critical cones) For a set Q C R" that is Clarke regular at 
a point x € Q, the critical cone to Q at x for v S Nq(x) is the set 

K Q (x,v) := N Nq{s) (v). 



Optimality, identifiability, and sensitivity 



31 



Because of the polarity relationship between normals and tangents, the 
critical cone Kq(x,v) can be equivalently described as 

K Q (x,v)=T Q (x)nv ± , 

where v 1 - is the subspace perpendicular to v. For more information about 
critical cones and their use in variational inequalities and complementarity 
problems, see [TT] . 

Connecting the classical theory of critical cones to our current work, we 
will now see that critical cones provide tangential approximations to locally 
minimal identifiable sets. In what follows, we denote the closed convex hull of 
any set Q C R™ by coQ. 

Proposition 7.4 (Critical cones as tangential approximations) 

Consider a set Q that is Clarke regular at a point x and a locally minimal 
identifiable set M at x for v £ Nq (x) . Suppose furthermore that M is prox- 
regular at x for v and is smoothly derivable at x. Then the equation 

coT M {x) = K Q (x,v), 

holds. 

Proof Observe 

K Q (x,v) = N Nq{S:) (v) = N Nm{s) (v) = N^ Mis) (v) = coTm(x) n v 1 -, 

where the second equality follows from Proposition 16.101 and the last equality 
follows from polarity of cl conv Tm (x) and Nm (x) . Hence to establish the claim, 
it is sufficient to argue that every tangent vector w G Tm{x) is orthogonal to 
v. 

To this end, fix a vector w £ Tm{x) and a C 1 -smooth path 7: [0, e) — » Q 
satisfying 

l{t)-x 

w = urn , 

tj.o t 

where e > is a real number and 7(0) = x. 

Let U £ (0, e) be a sequence converging to and define x% := j(U). Observe 
that for each index i, the tangent cone TmC^i) contains the line {XjiU) : A £ 
R}. Since M is necessary at x for v, there exist vectors Vi £ NQ(j(ti)) with 
Vi —s- v. By Proposition ^. 1Q[ we have v\ £ Nm( , j(U)) for all large i. For such 
indices, we have (fi,7(ij)) = 0. Letting i tend to 00, we deduce (v,w) = 0, as 
we needed to show. 

Classically, the main use of critical cones has been in studying polyhe- 
dral variational inequalities. Their usefulness in that regard is due to Propo- 
sition 17.51 stated below. We provide a simple proof of this proposition that 
makes it evident that this result is simply a special case of Proposition 16.101 
This further reinforces the theory developed in our current work. For an ear- 
lier proof that utilizes representations of polyhedral sets, see for example [^1 
Lemma 2E.41. 
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Proposition 7.5 (Polyhedral reduction) Consider a polyhedron Q C R n 
and a normal vector v £ Nq(x), for some point x £ Q. Let K := Kq(x,v). 
Then we have 

gph Nq — (x,v) — gphiVft- locally around (0,0). 

Proof By Example 14.41 the set M := argmax a . e g(a;, v) is the locally minimal 
identifiable set at x for v. Being polyhedral, M is smoothly derivable and it 
satisfies 

x + Tm{x) = M locally around x. 

In light of Proposition 17.41 we deduce M — x = K locally around 0. 
Thus for all (u, w) sufficiently near (0, 0) we have 

V + U £ Nq (x + w) <^=> V + u £ Nm (x + w) 
<^=> V + u £ Nk (w) 
<^==> u £ Nk(w) 

where the first equivalence follows from Proposition l6.10l and the last equiva- 
lence follows from the fact that K C v and so for all w £ K, the cone Nk(w) 
contains the line spanned by v. 

Proposition l7.5l easilv fails for nonpolyhedral sets. Indeed, in light of Propo- 
sition 17.41 this is to be expected since critical cones provide only tangential 
approximations to locally minimal identifiable sets. Such an approximation is 
exact only for polyhedral sets. Hence the theory of locally minimal identifi- 
able sets (in particular, Proposition 16. 101) extends Proposition [73] far beyond 
polyhedrality. 

We end this section by showing how Proposition 17.41 can be extended even 
further to the situation when locally minimal identifiable sets do not even 
exist. Indeed, consider a set Q that is Clarke regular at a point x, and let v £ 
Nq(x). Consider a nested sequence of open neighborhoods Vi of v satisfying 
Hi^i Vi = {^}- One would then expect that, under reasonable conditions, the 
equality 

oo 

K Q (x,v) =co f]T N -i (Vi) (x), 
»=1 

holds. To put this in perspective, observe that if there exists a locally minimal 
identifiable set M at x for v, then the sets T N -i^ V j(x) are equal to Tm{x) for 

all large i, and the equation above reduces to Proposition 17.41 More generally, 
the following is true. 

Proposition 7.6 (Critical cones more generally) Consider a set Q that 
is Clarke regular at a point x, and let v £ Nq(x). Consider a nested sequence 
of open neighborhoods Vi of v satisfying f\^ =1 Vi — {v} and the corresponding 
preimages Mi := Nq 1 ^). Assume that each Mi is smoothly derivable at x. 
Then the inclusion 

oo 

K Q (x,v)Dcof]T M M, (7-1) 
i=i 
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holds. Assume in addition that each Mi is prox-regular at x for v and that the 
formula 

oo oo 

co P| T Mi (x) = p| coT m ,(x), (7.2) 

i=l i=l 

holds. Then each Mi is an identifiable set at x for v and we have 

oo 

K Q {x,v)=cof]T AU {x). 

i=l 

We omit the proof of the proposition above since it follows along the same 
lines as the proof of Proposition 17.41 In particular, let us note that (|7.2[) holds 
whenever the tangent spaces Tm { (i) all coincide for sufficiently large indices i 
or whenever all Mi are Clarke regular at x. 



8 Optimality conditions 

In this section, we will see that the order of growth of a function / around a 
critical point (a point satisfying € df(x)) is dictated entirely by its order 
of growth around this point on a corresponding identifiable set. Here is a 
preliminary geometric result. 

Proposition 8.1 (Restricted optimality) Consider a closed set Q and a 
subset M C Q that is identifiable at x for a proximal normal v S Nq(x). Then 
x is a (strict) local maximizer of the linear function (v, ■) on M if and only if 
x is a (strict) local maximizer of (v, ■) on Q. 

Proof One implication is clear. To establish the converse, suppose that x is 
a local maximizer of the linear function (v, •) on M. We will show that the 
inequality, (v,x) > (v,x), holds for all points x 6 Q\M near x. Indeed, 
suppose this is not the case. Then there exists a sequence Xi — > x in Q \ M 
satisfying 

(v,x) < (v,xt). (8.1) 

Since v is a proximal normal, we deduce that there exists a real number r > 
satisfying Pq(x + r _1 w) = {x}. Consider any points Zi with 

Zi S Pq{xi + r^v). 

Clearly we have Zi — > x and 

(xi - z^ + r~ x v 6 N Q (zi). 

Since M is identifiable at x for v, we deduce Zi £ M for all large indices i. 
Consequently, for such indices i, we have Xi ^ Zi. 
Observe 

\(xi — Zi) + r^ x v\ < r" 1 jf;! . 
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Squaring and canceling terms, we obtain 



(v,Zi - Xi) > -\zi - Xi\ 2 . (8.2) 



Consequently, 



r 



\zi - Xi\ 2 <(v,x- Xi) < 0, 



which is a contradiction. Claim (1) now follows. 

Recall that a function / : R" — 5- R is said to grow quadratically around x 
provided that the inequality 

itotof IM ip > o, 

x^-x \x — X\ Z 

holds. We now arrive at the main result of this section. 

Proposition 8.2 (Order of growth) Consider a function f : R™ — > R and 
a set M C R™. Suppose that M is identifiable at x for v = £ dpf(x). Then 
the following are true. 

1. x is a (strict) local minimizer of f restricted to M ^ x is a (strict) local 
minimizer of the unrestricted function f. 

2. More generally, consider a growth function g: U — > R, defined on an open 
neighborhood U ofO, that is C 1 -smooth and satisfies 

f(x) < f(x) — g{x — x) for all x G M near x, 
.9(0) = 0, V ff (0) = 0, 

Then the above inequality, in fact, holds for all points x £ R n near x. 
In particular, the function f , restricted to M , grows quadratically near x 
if and only if the unrestricted function f grows quadratically near x. 

Proof We first prove claim (1). By Proposition 13.161 gph/|„ is identifiable, 
with respect to epi/, at (x, f(x)) for (0, —1). Now observe that a; is a (strict) 
local minimizer of f\ M if and only of (x,f(x)) is a (strict) local maximizer 
of the linear function, (x,r) i— > —r, on gph/| M . Similarly x is a (strict) local 
minimizer of / if and only of (x,f(x)) is a (strict) local maximizer of the 
linear function, (x,r) i— > — r, on epi/. Combining these equivalences with 
Proposition 18.11 establishes the claim. 

We now prove claim (2). Suppose that the growth condition is satisfied. 
Let h := f — g[x — x). Since / is C 1 -smooth, 17(0) = 0, and Vg(0) = 0, it easily 
follows that M is identifiable, now with respect to h, at x for £ dph(x). 
Furthermore, the point a; is a strict local minimizer of h\ M . Applying claim 
(1) of the current proposition, we deduce that a; is a strict local minimizer of 
the unrestricted function h, that is 

f(x) = h(x) < h(x) — f(x) — g(x — x), for all x near x, 



as we needed to show. 
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In particular, we obtain the following curious characterization of quadratic 
growth. 

Corollary 8.3 (Refined optimality) Consider a function f: R" — > R and 
a point x with G dpf(x). Then f grows quadratically around x if and only if 

liminf ~ f i x) > 0. (8.3) 

(xj(x),v)^(x,f(x),0) \X-X\ 2 
V£df(x) 

Proof Clearly if / grows quadratically around x, then (|8.3[) holds. Conversely, 
assume (|8.3p holds and let Vi be a sequence of neighborhoods of shrinking 
to and let ti > be real number tending to 0. Then the sets 

Mi := (df)-\V t ) f){xe R" : \f(x) - f(x)\ < ej, 

are identifiable at x for 0. Furthermore, / restricted to Mi must grow quadrat- 
ically around x, for all sufficiently large indices i, since the alternative would 
contradict (|8 . 3[) . Applying Proposition [521 we obtain the result. 



9 Identifiable Manifolds 

Consider a closed set Q and a normal vector v S Nq(x), for a point x G Q. 
The inherent difficulty in analyzing properties of the optimization problem, 

P(v) : max (v, x), 
s.t. x € Q, 

such as dependence of the local maximizers of P(v ) on v or the order of growth 
of the function x H ► (x, v) on Q near x, stem entirely from the potential 
nonsmoothness of Q. However, as we have seen in Proposition 16.101 the local 
geometry of gph Nq is entirely the same as that of a prox-regular identifiable 
set M at x for v. Thus, for instance, existence of an identifiable manifold M 
at x for v shows that the nonsmoothness of Q is not intrinsic to the problem 
at all. Our goal in this section is to investigate this setting. We begin with the 
following easy consequence of Proposition ^. 101 

Proposition 9.1 Consider a closed set Q C R" and suppose that a subset 
M C Q is a C 2 identifiable manifold at x for v € Nq (x) . Then the following 
properties hold. 

1. v lies in the interior of the cone Nq(x), relative to its linear span Nm(x). 

2. There exists an open neighborhood U of x and V of v such that the mapping 
x i V V M Nq(x), restricted to M , is inner- semicontinuous at each x 6 

unM. 
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Proof To see the validity of the first claim, observe that if it did not hold, then 
we could choose a sequence of vectors Vi satisfying 

Vi ->V, Vi £ N M (x), Vi 4- Nq(x), 

thus contradicting Proposition 16. 101 

The second claim now easily follows from Proposition 16.101 

Consider a locally minimal identifiable subset M C Q at x for v G Nq(x). 
Then M remains identifiable at x for v G Nq(x), whenever the pair (x,v) is 
sufficiently close to (x,v). However under such perturbations, M might cease 
to be locally minimal, as one can see even from polyhedral examples. (Indeed 
when Q is a convex polyhedron, this instability occurs whenever the inclusion 
v G rb N Q (x) holds.) 

In the case of identifiable manifolds, the situation simplifies. Identifiable 
manifolds at x for v G Nq(x) are automatically locally minimal, and fur- 
thermore they remain locally minimal under small perturbations to (x,v) in 
gphiVQ. 

This important observation is summarized below. 

Proposition 9.2 Consider a closed set Q and a C 2 identifiable manifold M C 
Q at x for v G Nq(x). Then M is automatically a locally minimal identifiable 
set at x G M for v G Nq(x) whenever the pair (x,v) is near (x,v). 

Proof This follows directly from Proposition 12 .41 and Proposition l9.il 

In particular, identifiable manifolds at x, for v G df(x) are locally unique. 

9.1 Relation to Partial Smoothness 

In this section, we will relate identifiable manifolds to the notion of partial 
smoothness, introduced in j!9) . The motivation behind partial smoothness 
is two-fold. On one hand, it is an attempt to model an intuitive idea of a 
"stable active set" . On the other hand, partial smoothness, along with certain 
nondegeneracy and growth conditions, provides checkable sufficient conditions 
for optimization problems to possess good sensitivity properties. Evidently, 
partial smoothness imposes conditions that are unnecessarily strong. We now 
describe a variant of partial smoothness that is localized in a directional sense. 
This subtle distinction, however, will be important for us. 

Definition 9.3 (Directional Partial Smoothness) Consider a closed set 
Q C R™ and a C 2 -manifold M C Q. Then Q is partly smooth with respect to 
M at x & M for v G N Q (x) if 

1. (prox-regularity) Q is prox-regular at x for v. 

2. (sharpness) spanA^x) = Nm{x). 

3. (continuity) There exists a neighborhood V of v, such that the mapping, 
x I— > V H Nq(x), when restricted to M, is inner-semicontinuous at x. 
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We arrive at the main result of this subsection. 

Proposition 9.4 (Identifiable manifolds and partial smoothness) 

Consider a closed set Q C R n and a subset M C Q that is a C 2 manifold 
around a point x € Q. Let v € Nq(x). Then the following are equivalent. 

1. M is an identifiable manifold at x for v. 

2. We have 

gph Nq — gphiVft/ locally around (x,v). 

3. — Q is partly smooth with respect to M at x for v. 
— the strong inclusion v G tiNq(x) holds. 

4- The set Q is prox-regular at x for v, and for all sufficiently small real 
numbers A, e > 0, the inclusion 

x + Aueint( (J (x + N Q (x)) 

x£MnB„(x) 

holds. 

Proof The equivalence (TTJ) <^> © has been established in Proposition 16.101 
The implication (HJ => © follows trivially from Propositions 16.81 and 19.11 
© © : There exist real numbers r, e > so that the implication 

\x - x\ < e, \v - v\ < e } ^ ^w* x + r v) - x, 

holds. 

For the sake of contradiction, suppose 

x + r -1 u e bd ( |J (x + N Q (x))y 

x£MnB e (x) 

Then there exists a sequence of points Zi — > x + r~ x v with 

Zii |J (x + N Q (x)), 
xeMnB € (x) 

for each index i. Let xi € Pm{zi). Observe 

Xi ->• x, Zi - Xi € N M (xi). 

Clearly, 

Zi - Xi £ Ngtxi) C N M {xi), 

for large indices i. Hence, there exist separating vectors a; £ Nn,[(xi) with 
| a, | = 1 satisfying 

SUp (a,i,v) < ((li,Zi - Xi) = (Oi,Zi -x) + (a 2 ,x - Xi). 

veNgixi) 



38 



D. Drusvyatskiy, A. S. Lewis 



We deduce, 

sup ((H,v) < (ai,r(zi - x)) + (a;,r(x - £;)). 

veNg(xi) 

Passing to a subsequence, we may assume a. L — > a for some nonzero vector 
a e Nm{x). Observe v + 8a 6 Ncj(xi) for all small 6 > 0. Consequently for 
all sufficiently small S > 0, there exist vectors G NQ^Xi) with I), — > u + 5a. 
Observe 

(a,, Uj) < (a,, r(z l - x)) + (a,-, r(i - a;,)}. 
Letting i tend to oo, we obtain 

(a, v + Sa) < (a, v), 

which is a contradiction. 

© © : Choose r, e > so as to ensure 

x + r^v G int ( |J (a; + JVq (a;)) j . 

i£MnB,(i) 

Consider any sequence of points Xi 6 R™ and vectors Vi G Ng(xi), with — >• a; 
and «i — >■ v. Then for all large indices i, the inclusion 

x l +r~ 1 n l e |J (x + N Q (x)), 

x£MC\B t (x) 

holds. Shrinking r and e, from prox-regularity of Q, we deduce £ M for all 
large indices i. Hence M is identifiable at x for 

Some comments concerning characterization (jl]) of the previous proposition 
are in order. Consider a convex set Q containing a point x, and let v £ Nq(x) 
be a normal vector. Then the arguments © =>• (0} and (HJ) =4> (HJ show that a 
manifold M C Q is identifiable at al for v if and only if the inclusion 

x + v G int ( (J (o; + JVQ(a;))V 

holds. The region UzeM ( x + Nq(x)) is formed by attaching cones Nq(x) 
to each point a; G M. This set is precisely the set of points in R™ whose 
projections onto Q lie in M. Thus a manifold M is identifiable at x for u 
whenever the region \J xeM (x + Nq(x)) is "valley-like" around x + v. We end 
the section with an observation relating identifiable manifolds to critical cones. 

Corollary 9.5 Consider a closed set Q C R™ that is Clarke regular at a point 
iG(J. Suppose that M C Q is a C 2 identifiable manifold at x for v. Then the 
critical cone Kq[x,v) coincides with the tangent space Tm(x). 

Proof This is an immediate consequence of Proposition 17.41 
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9.2 Projections onto identifiable manifolds 

In this subsection, we show that C 2 identifiable manifolds occur precisely as 
the images of C 1 -smooth projections with constant rank. Predecessor results 
of a similar flavour can be found in [T7] and [35]. We will need the following 
classical result. 

Proposition 9.6 Consider a C 2 -smooth manifold M C R" and a point x S 
M. Then there exists a neighborhood U of x so that the projection mapping 
Pm is a C 1 -smooth submersion onU . In particular it sends open subsets ofU 
to open subsets of M. 

Proof Let V be a neighborhood of x so that 

M n V = {x S V : hi(x) = for i = 1, . . . , m}, 

where hi : V —> R are C 2 -smooth defining functions and the gradients Vhi(x) 
are linearly independent. Since M is prox-regular at x, the mapping y n> 
Pm{v) is single- valued near x. Furthermore, the point Pm(d) is precisely the 
point x solving the system 

hi(x) = 0, for i = 1, . . . ,m, 
x + \iVhi(x) = y. 

i=l 

for some A £ R m . Applying the inverse function theorem to the system above 
yields the result. 

Proposition 9.7 (Smooth projections) Consider a closed set Q C R™ 
and a C 2 manifold M C Q containing a point x. Let v £ Nq(x) be a normal 
vector. Then the following are equivalent. 

1. M is an identifiable manifold around x for v. 

2. The set Q is prox-regular at x for v, and for all sufficiently small real 
numbers X and all sufficiently small neighborhoods U of x + Xv, 

— The projection mapping Pq is C 1 -smooth and has constant rank on the 
ball U. 

— The image Pq(U) coincides with M near x. 

Proof ([!]) @ : This follows immediately from Propositions 16.81 10 . 61 and 

Em 

© => dU) : This is immediate from Proposition 16. 151 
10 The case of functions 

Many of the results we have derived in the previous sections were stated in 
terms of sets. Their generalizations to the functional setting can be carried out 
in a standard way by appealing to Proposition 13. 161 In this section, we record 
some of these results for ease of reference in future work. Most of the proofs 
are omitted. 
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Proposition 10.1 (Prox-regularity of identifiable sets) Consider a Isc 
function f : R™ — > R and a set M C R™ that is identifiable at x for v £ df(x). 
In addition, suppose that f + 5m is prox-regular at x for v. Then f is prox- 
regular at x for v. 

In stating the next results, it will be convenient to define "subjets" associ- 
ated to each subdifferential. Namely, for a function / : R™ — > R, we define the 
limiting subjet to be the set 

[df] := {(*, f{x), v) £ R™ x R x R" : v £ df(x)}. 

Subjets corresponding to the other subdiffcrentials are defined analogously. 
These objects are useful for concisely describing variational properties of func- 
tions that are not necessarily subdifferentially continuous. On the other hand 
if / is subdifferentially continuous, as is often the case, then in the results that 
follow we may simply use subdifferential graphs, instead of subjets. 

Proposition 10.2 (Reduction I) Consider a Isc function f : R™ — > R and 
a set M C R™ so that f + 5 m is prox-regular at a point x for v £ df(x). Then 
M is identifiable at x for v if and only if 

[df] = [^(/ + $m)] locally around (x, f(x),v). 

Proposition 10.3 (Reduction II) Consider a Isc function f : R™ — > R, a 
point x £ R™, and a subgradient v £ df(x). Suppose 

[d p '/] = [df] locally around (x, f(x),v), 

and consider a set M := (9/) _1 (V r ) fl {x : \f(x) — f(x)\ < e), where V is a 
convex, open neighborhood of v and e > is a real number. Then the equation 

gph [df] = gph [d(f + 5m)] holds locally around (x, f(x), v). 

Proposition 10.4 (Prox-regularity and local minimality) 

Consider a Isc function f : R n — > R and a set M C R™ that is a locally 
minimal identifiable set at x for v £ df(x). Then f is prox-regular at x for v 
if and only if f + 8m is prox-regular at x for v. 

The following is a standard generalization of critical cones to the functional 
setting. 

Definition 10.5 (Critical cones) For a function /: R" — > R that is Clarke 
regular at a point x, the critical cone of / at x for v £ df(x) is the set 

K f (x,v) := N dm (v). 

Definition 10.6 (Smooth derivability) A function /: R™ — >• R is said to 
be smoothly derivable at a point x £ R" if gph/ is smoothly derivable at 
(xj(x). 
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Proposition 10.7 (Critical cones more generally) Consider a function 
f : R™ — > R that is Clarke regular at a point x, and let v G df(x). Consider a 
nested sequence of open neighborhoods Vi of v satisfying DSi Vi — { v } an d a 
sequence of reals J, 0, and the corresponding preimages Mi :~ df~ 1 (Vi)n{x : 
\f(x) — f(x)\ < e}. Assume that each f + SMi is smoothly derivable at x. Then 
the inclusion 

oo 

K f (x,v) D co f]T Mi (x), (10.1) 

i=l 

holds. Assume in addition that each / + <5m 4 is prox-regular at x for v and that 
the formula 

oo oo 

So p| TmM = f]coT Mi {x), (10-2) 
holds. Then each Mi is an identifiable set at x for v and we have 

oo 

K f (x,v) = co f]T Mi (x). 
i=i 

In particular, the following is true. 

Proposition 10.8 (Critical cones as tangential approximations) 

Consider a function f : R" — > R that is Clarke regular at a point x, and let M 
be a locally minimal identifiable set at x for v G df(x). Suppose furthermore 
that f + Sm is prox-regular at x for v and is smoothly derivable at x. Then the 
equation 

coT M {x) = K f (x,v), 

holds. 

Definition 10.9 (Identifiable manifolds) Given a function /: R™ — > R, a 
set M C R™ is a C p identifiable manifold at i for C e df(x) provided that the 
following hold. 

— M is a C p manifold around x and the restriction /| is C p -smooth near 
x. 

— M is identifiable at x for v. 

Proposition 10.10 Consider a Isc function f : R" — > R and a C 2 identifi- 
able manifold M C dom/ at x for v G df(x). Then M is automatically a 
locally minimal identifiable set at x £ M for v G Nq(x), whenever the triple 
(x,f(x),v) is near (x 7 f(x) 7 v). 

A functional version of partial smoothness is as follows. For a convex sets 
K C R™, we let par K be parallel subspace to K, that is par K = span {K — x} 7 
where x is any point of K. 

Definition 10.11 (Directional partial smoothness) Consider a lsc func- 
tion /: R" — > R and a C 2 -manifold M C dom/. Then / is partly smooth with 
respect to M at x G M for v G df(x) if 
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1. (prox-regularity) / is prox-regular at x for v. 

2. (smoothness) The restriction f\ M is C 2 smooth around x. 

3. (sharpness) par<9/ = Nm(x). 

4. (continuity) There exists a neighborhood V of 5, such that the mapping, 
i4Ffl df(x), when restricted to M, is inner-semicontinuous at x. 

Proposition 10.12 (Identifiable manifolds and partial smoothness) 

Consider a Isc function f: Ft" — > R and a set M C R n , so that M is a C 2 
manifold around x and f is C 2 -smooth on M around x. Consider a subgradient 
v £ df(x). Then the following are equivalent. 

1. M is an identifiable manifold at x for v. 

2. We have 

[df] = [9(f + ^m)} locally around (x, f(x),v). 

3. — f is partly smooth with respect to M at x for v. 
— the strong inclusion v 6 ridf(x) holds. 

Given a function / : R" — > R and a parameter A > 0, the proximal mapping 
P x f : R" ->■ R is defined by 

P\f{x) ■= argmin{/(y) + ^-\y - x\ 2 } . 

The results of Section[BJ concerning projections, may be extended to the func- 
tional setting with the proximal mapping taking the place of the metric pro- 
jection. However for the sake of brevity, we do not pursue this further. 

Acknowledgements The authors thank Alex D. Iofle for the insightful conversations re- 
lated to the current work. 
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